Computation of term/document discrimination values by use of the cover coefficient concept

نویسندگان

  • Fazli Can
  • Esen A. Ozkarahan
چکیده

Indexing in information retrieval (IR) is used to obtain a suitable vocabulary of index terms and optimum assignment of these terms to documents for increasing the effectiveness and efficiency of an IR system. The concept of term discrimination value (TDV) is one of the criteria used for index-term selection. In this article a new concept called the cover coefficient (CC) will be used in computing TDVs. After a brief introduction to the theory of indexing and the CC concept, an efficient way of computing TDVs by use of the CC concept, index-term selection, and weight modification are discussed. It is also shown that the computational cost of the CC approach in the calculation of TDVs is favorably comparable to the cost of a different approach that uses similarity coefficients. Furthermore, the TDVs obtained by the CC approach are consistent with those of the latter approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

اجزای محدود غنی‌سازی شده تطبیقی اتوماتیک به‌وسیله توابع غنی‌ساز پوششی

In this paper, a method is proposed to improve the results of the standard finite element method. L2 norm is used to determine the  nodal error. In the next step, the appropriate order of the interpolation cover is seclected to be proportional to the nodal error and the results are corrected. The error computation procedure and the use of covering enrichment functions will continue until the er...

متن کامل

Methadone Maintenance Treatment in Iran and Brief Psychological Treatments for Women: A Rehabilitation Approach in Methamphetamine Dependence

Objective Methamphetamine use in patients treated with methadone is a health problem in Iran that reduces the benefits of this treatment. This has been more reported by women than by men. Short-term psychological interventions are one of the major methods of rehabilitation to solve this problem. The current study aimed to explore the reasons for methadone patients for using these interventions ...

متن کامل

مدل جدیدی برای جستجوی عبارت بر اساس کمینه جابه‌جایی وزن‌دار

Finding high-quality web pages is one of the most important tasks of search engines. The relevance between the documents found and the query searched depends on the user observation and increases the complexity of ranking algorithms. The other issue is that users often explore just the first 10 to 20 results while millions of pages related to a query may exist. So search engines have to use sui...

متن کامل

Correlation Coefficients for Hesitant Fuzzy Linguistic Term Sets

Here are many situations in real applications of decision making where we deal with uncertain conditions.  Due to the different sources of uncertainty,  since its original definition of fuzzy sets in 1965 cite{zadeh1965},  different generalizations and extensions of fuzzy sets have been introduced: Type-2 fuzzy sets cite{6,13}, Intuitionistic fuzzy sets cite{1}, fuzzy multi-sets cite{37} and et...

متن کامل

Transition Potential Modeling of Land-Cover based on Similarity Weighted Instance-based Learning Procedure and Its Implication in the REDD Project Design Document

  Reducing Emissions from Deforestation and Forest Degradation (REDD) is a climate change mitigation strategy employed to reduce the intensity of deforestation and GHGS emissions. In recent decades, drastic land use changes in Mazandaran province caused a substantial reduction in the amount of Hyrcanian forests. The present research based on objectives of REDD projects paid to identify of fore...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JASIS

دوره 38  شماره 

صفحات  -

تاریخ انتشار 1987